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Hydantoinase Variants With Impr ved Properties 
And Their Use P r The Production Of Amino Acids 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates generally to hydantoinases. More 
particularly, the present invention involves the discovery of a number of 
modified hydantoinases which exhibit improved enzymatic properties relative 
to previously isolated hydantoinases and their use in whole-cell catalysts to 
manufacture amino acids. 

2. Description of Related Art 

Hydantoin hydrolyzing enzymes, which will be referred here as 
'hydantoinases', comprise a diverse class of enzymes having a wide range of 
specificities and biological functions. Some hydantoinases for example play an 
essential role in the reductive pathway of pyrimidine degradation 
(dihydropyrimidinases, EC 3.5.2.2) whereas others catalyze reactions in the 
purine degradation pathway (allantoinases, EC 3.5.2.5). Despite their 
functional diversity hydantoinases show significant sequence similarities and 
belong to a superfamily of amidohydrolases related to ureases as described by 
Holm, L., and Sander, C. (1997) An evolutionary treasure: Unification of a 
broad set of amidohydrolases related to ureases, Proteins 28:72-82; The 
alignment of sequences from the different hydantoinases was used to identify 
conserved residues that are important for catalytic function as described by 
May, O., Habenicht, A., Mattes, R., Syldatk, C. and Siemann, M. (1998) 
Molecular Evolution of Hydantoinases, Biol Chem. 379:743-747; and Kim, 
G.J. and Kim, H.S. (1998) Identification of the structural similarity in the 
functionally related amidohydrolases acting on the cyclic amide ring. Biochem. 
J. 330:295-302. Despite this knowledge that allows to identify equivalent 
amino acid residues of the different hydantoinases, knowledge about the 
function of other amino acid residues is limited. So far, no X-ray structure of 
hydantoinases was reported. 

An important property of hydantoinases is their enantioselectivity which 
makes them valuable for the production of optically pure D- or L-amino acids. 
A detailed background of hydantoinases is provided in the published doctoral 
thesis of Oliver May entitled "The Hydantoinase from Arthrobacter aurescens 
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DSM 3745 and its Relation to other Hydantoinases" (Institut fuer 
Bioverfahrenstechnik, Lehrstuhl Physiologische Mikrobiologie, Universitaet 
Stuttgart- 1998). 

In view of the importance of hydantoinases to the production of optically 
pure amino acids, there has been a concentrated effort to develop modified 
enzymes which have improved properties with respect to amino acid 
production. As a result of this effort, a number of microorganisms have been 
isolated and identified which produce hydantoinases with desirable enzymatic 
properties. U.S. Pat. No. 5,516,660 discloses microoganisms identified as 
DSM7329 and DSM 7330 which produce hydantoinases that are capable of 
producing L-alpha-amino acids from D-, L- and/or D,L-5-monosubstituted 
hydantoins. In U.S. Pat. No. 5,714,355, a mutant of the DSM 7330 
microorganism is disclosed which has greater enzymatic activity than the 
parent organism by a factor of up to 2.7. The mutant (DSM 9771) was 
obtained by cultivating the parent DSM 7330 organism under selective 
pressure using L-carbamoylmethionine (L-CAM) as the sole source of nitrogen. 
Although the hydantoinases produced by the above-mentioned microoganisms 
are well-suited for at least some of their intended purposes, there still is a 
continuing need to develop new enzymes which exhibit even more desirable 
hydantoinase activity. In particular, there is a need to improve 
enantioselectivity as well as catalytic activity. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, modified hydantoinases are 
provided which have enhanced enzymatic properties (better whole-cell 
catalysts) with respect to the hydantoinase produced by the microorganism 
DSM 9771 which is identified in U.S. Pat. No. 5,714,355. The DSM 9771 
hydantoinase has an amino acid sequence which includes numbered positions 
ranging sequentially from 1 to 458 (SEQ. ID. NO. 2). 

It was discovered that substitution of amino acids at one or more 
specific amino acid positions within the DSM 9771 enzyme resulted in the 
formation of enzymes having enhanced properties with respect to activity and 
enantioselectivity. The specific amino acid position numbers at which 
substitutions are made to achieve the modified hydantoinase enzymes in 
accordance with the present invention are positions Nos. 95, 154, 180, 251 
and 255. As a further feature of the invention, specific amino acid 
substitutions at the various positions are identified to provide specific types of 
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modified hydantoinases. The specific amino acid substitutions include I95F, 
I95L, V154A, V180A, Q251R and V255A. One or more of these specific 
substitutions were found to enhance the enzymatic activity and change the 
enantioselectivity of the "wild type" DSM 9771 hydantoinase. These changed 
enzyme properties were found to contribute to a significantly improved 
hydantoinase process by reducing the accumulation of the wrong enantiomer 
of the N-carbamoyl-amino acid. 

Six specific modified hydantoinases are disclosed which have one or 
more of the above amino acid substitutions. The amino acid sequences for 
these modified hydantoinases are set forth in SEQ. ID. NOS. 4, 6, 8, 10, 12 
and 14. These modified hydantoinases are also identified throughout the 
specification as 1CG7, 11DH7, 1BF7, 19AG11, 22CG2 and Q2H4, respectively. 

It was further discovered that hydantoinases evolved for activity and/or 
enantioselectivity can dramatically improve the production of amino acids (i.e., 
L-methionine) using a whole cell catalyst comprising an evolved hydantoinase 
in addition to at least a carbamoylase. 

The above discussed and many other features and attendant 
advantages of the present invention will become better understood by reference 
to the following detailed description when taken in conjunction with the 
accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is the restriction map of the vector used for expression of the 
hydantoinase gene (hyuH) from Arthrobacter sp. DSM9771. 

FIG. 2 is a chromatogram showing the results of the separation of N- 
carbomoyl-methionine enantiomers produced by mutants Q2H4, 11DH7 and 
1CG7 as compared to the DSM 9771 hydantoinase. 

FIG. 3 is a chart which depicts the relative improvements in enzymatic 
activity and enantioselectivity which is provided by the mutant enzymes of the 
present invention with respect to the DSM 9771 hydantoinase (W). 

FIG. 4 is a chart which shows enzymatic activity of different 
Arthrobacter strains and mutant Q2H4 of the present invention. 
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FIG. 5 is two charts showing the time course of the hydrolysis of 100 
mM D,L-MTEH in 0.1M Tris pH 7.8, 37°C with 8mg cell dry mass of A) Kcoli 
JM109 (pOM20/pOM21) (wildtype pathway) and B) Kcoli JM109 
(pOM22/pOM21) (pathway with evolved hydantoinase from Q2H4). 

FIG. 6 is a chart showing a comparison of the hydrolysis of 100 mM 
D,L-MTEH in 0.9% (w/v) NaCl, ImM MnCl 2 , pH 7.8 with 25g/l Kcoli JM109 
(pOM20, pOM21) and E.coli JM109 (pOM21, pOM23) after 1 hour reaction 
time. 



DETAILED DESCRIPTION OF THE INVENTION 
The modified hydantoinases in accordance with the present invention 
were produced, identified and isolated using random mutagenesis procedures 
of the type described in U.S. Pat. Nos. 5,316935 and 5,906,930. Random 
mutagenesis protocols, which are also known as directed evolution procedures, 
are also described in Kuchner, O., Arnold, F.H. (1997) Directed Evolution of 
Enzyme Catalysts, TIBTECH 15:523-530; Chen, K. and Arnold F. (1991). 
En2yme engineering for nonaqueous solvents — random mutagenesis to 
enhance activity of subtilisin E in polar organic media, Bio/ Technology 9:1073- 
1077; Chen, K. and Arnold, F. (1993) Tuning the activity of an enzyme for 
unusual environments: sequential random mutagenesis of subtilisin E for 
catalysis in dimethylformamide, Proc. Natl Acad. Sci. USA 90:5618-5622; and 
You, L. and Arnold, F.H. (1996). Directed Evolution of Subtilisin E in Bacillus 
Subtilis to Enhance Total Activity in Aqueous Dimethylformamide, Protein 
Engineering, 9, 77-83. 

The random mutagenesis procedure used to identify and isolate the 
modified hydantoinases followed the same basic procedures as identified 
above. First, a large number of random mutations in the wild type nucleotide 
sequence (SEQ. ID. NO. 1) were generated. This library of nucleotide 
sequences where then used to express a large number of mutated enzymes. 
The library of mutated hydantoinases was then screened to identify those 
mutants with enhanced enzymatic activity and changed enantioselectivity. 

The step of screening the first library of expressed amino acid 
sequences to identify desirable variants could have been accomplished using 
any number of suitable screening techniques which measure desirable enzyme 
properties. The screening method actually used was a pH-indicator assay 
which will be described in more detail below. 
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In accordance with the present invention, four enzymes having 
enhanced hydantoinase properties were identified as the result of the first 
round of random mutagenesis of the DSM 971 1 nucleotide sequence (SEQ. ID. 
NO. 1). The first round mutant en2ymes are 1CG7, 11DH7, 1BF7 and 
19AG11.. The nucleotide sequences for these first round mutants are set forth 
in SEQ. ID. NOS. 3, 5, 7 and 9, respectively. The corresponding amino acid 
sequences are set forth in SEQ. ID. NOS. 4, 6, 8 and 10, respectively. 

A second round of random mutagenesis was conducted in which the 
1 1DH7 nucleotide sequence was randomly mutated to form a second library of 
mutants. A single mutant (22CG2) was identified which expressed a modified 
hydantoinase that exhibited desirable enzymatic properties. The 22CG2 
enzyme is the same as the 11DH7 enzyme except that the 22CG2 mutant has 
an amino acid substitution at position 180. 

The 22CG2 mutant was subjected to saturation mutagenesis in order to 
introduce all 20 different amino acids into amino acid position 95. 400 clones 
were screened and a mutant enzyme with enhanced enzymatic activity and 
higher (L)-selectivity was identified as Q2H4. The Q2H4 mutant is the same as 
the 22CG2 mutant except that phenylalanine is substituted for isoleucine at 
position 95. 

As a result of the isolation and identification of the above identified 
mutants, it was established that improved hydantoinases may be obtained by 
modifying the DSM 9771 enzyme by substituting amino acids at positions 95, 
124, 154, 180, 251 and 255. The substitutions may be made at one or more of 
the positions. Table 1 sets forth preferred amino acid substitutions. 



Table 1 



Amino Acid Position 


Substitution 


Abbreviation 


95 


He Phe 


I95F 


95 


He -» Leu 


I95L 


154 


Val-* Ala 


V154A 


180 


Val-> Ala 


V180A 


251 


Gin -> Arg 


Q251R 


255 


Val-+Ala 


V255A 
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Amino acid substitutions other than those set forth in Table 1 are 
possible provided that the resulting hydantoinase exhibits desirable enzymatic 
properties. For example, other suitable amino acid substitutions for isoleucine 
at position 95 include Gly, Ala, Val, Leu, Phe, Tyr and Trp. For positions 154, 
180 and 255, suitable alternative amino acid substitutions for valine include 
Ala and Gly. Suitable alternative amino acid substitutions at position 251 for 
glutamine include Arg, Lys and Asn. The amino acid substitutions may be 
made by saturation mutagenesis followed by screening of clones. The 
substitutions may also be made by chemical manipulation of the DSM 9711 
enzyme or by conventional synthesis of peptides having the desired amino acid 
substitutions at the desired locations. It should be noted that the above listed 
amino acid substitutions are intended to be exemplary of preferred alternative 
substitutions at the various substitution sites. Substitutions of other amino 
acids are possible provided that the enzymatic activity of the resulting protein 
is not destroyed. The usefulness of a particular amino acid substitution at 
positions 95, 154, 180, 251 and 255 can be determined by routine pH 
screening as described below. 

The amino acid substitutions described above may be made at 
equivalent positions in other hydantoinases. "Other hydantoinases" refers to 
enzymes that catalyze the hydrolysis of any S'-mono- or disubstituted 
hydantoin derivative to yield the derived N-carbamoyl-amino acid and might 
have between 20 and 100% amino acid sequence identity to the hydantoinase 
from Arthrobacter sp. DSM 9771 which can be determined by sequence align- 
ment algorithm such as BLAST (Altschul, S.F., Gish, W., Miller, W., Myers, 
E.W. & Lipman, D.J. (1990) "Basic local alignment search tool." J. Mol Biol 
215:403-410). 

Amino acid positions are numbered in a linear order (starting at the 
start codon) and not according to their functional and structural context. 
Therefore amino acid residues that contribute in the same way to an enzyme 
function of different hydantoinases do not necessarily have the same amino 
acid position number due to deletion or insertion events in the homologous 
enzyme. "Equivalent positions" of hydantoinases therefore refers to amino acid 
positions that contribute in the same way to a function (activity or enantdo- 
selectivity) as the amino acids identified in our evolution experiment. If the 
amino acid sequence identity of different hydantoinases is high, for example 
higher than 60%, and the amino acid position is located in a conserved region 
without sequence gaps, equivalent positions can be determined by sequence 
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alignment using for example the BLAST algorithm. If the amino acid sequence 
identity is low, for example lower than 60%, and the amino acid position is 
located in a non-conserved region, or near gaps without being surrounded by 
regions of conserved amino acids, other methods such as structure alignments 
can be used if x-ray structures are available (Mizuguchi, K., Go, N., Seeking 
significance in 3-dimensional protein-structure comparisons. Cur. Opin. Struc. 
Biol 5:377-382 (1995)). Here, backbone atoms are structurally aligned and 
equivalent positions can be found based on the relative locations of the amino 
acid residues of the structures. 

An amino acid position that is identified for example by directed 
evolution to contribute to a specific function can often be occupied by different 
amino acid residues, not just the one that was identified by random point 
mutagenesis. Some substitutions will destroy the function, some of them will 
not change the function, and yet others will improve the function. With known 
methods, such as site saturation mutagenesis, one can easily identify amino 
acids that contribute in the same way to a function or even improve it by 
replacing the found amino acid residue with all possible amino acid residues 
(Miyazaki, K., Arnold, F., Exploring nonnatural evolutionary pathways by 
saturation mutagenesis: rapid improvement of protein function. J. AfoZ. Evol 
49:1716-1720). Even non-natural amino acids can be introduced at the 
identified site using a stop codon and a suppresser tRNA linked to a non- 
natural amino acid (Bain, J.D., Glabe, C.G., Dix, T.A., Chamberlain, A.R., 
Diala, E.S., Biosynthetic Site-Specific Incorporation Of A Non-Natural Amino- 
Acid Into A Polypeptide. JACS 111:8013-8014 (1989)). 

Six modified hydantoinases in accordance with the present invention 
are listed in Table 2. Table 2 also lists the amino acid substitutions with 
respect to the DSM 9771 sequence (SEQ. ID. NO. 2) for each modified enzyme 
which is identified. 
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Table 2 



Hydantoinase 


Amino Acid 


Variant 


Substitution 


1CG7 (SEQ. ID. NO. 4) 


V154A 


11DH7 (SEQ. ID. NO. 6) 


I95L+ Q251R 


1BF7 (SEQ. ID. NO. 8) 


V255A 


19AG11 (SEQ. ID. NO. 10) 


I95L 


22CG2 (SEQ. ID. NO. 12) 


I95L + V180A+ Q251R 


Q2H4 (SEQ. ID. NO. 14) 


I95F + V180A+ Q251R 



The modified hydantoinases of the present invention may be used in the 
same manner as other hydantoinases to produce optically pure D- and L- 
amino acids. For example, see Biocatalvtic Production of Amino Acids and 
Derivatives (Rozzell, J.D. and Wagner, F. eds.) (1992) Hanser Publisher, NY, at 
pages 75-176, for a description of the use of hydantoinases in the production 
of optically pure amino acids from DL-5-monosubstituted hydantoins. The 
general use of hydantoinases is also described in Enzyme catalysis in organic 
synthesis (Dranz, K. and Waldmann, H. eds.) 1995, VCH-Verlag, Weinheim, at 
pages 409-431; and Wagner, T. et al. (1996) Production of 1-methionine from 
d,l-5-(2-methylthioethyl) hydantoin by resting cells of a new mutant strain of 
Arthrobacter species DSM 7330, Journal of Biotechnology 46:63-68. 

Amino acids referred to in the present invention are all natural or 
unnatural amino acids, wherein the amino acids are deemed to be a primary 
amine connected to carboxylic acid group via one intermediate C-atom 
(a-C-atom). This C-atom bears only one further residue. Preferred unnatural 
amino acids are disclosed in DE 19903268.8. Preferred natural amino acids 
are those mentioned in Beyer- Walter, Lehrbuch der Organischen Chemie, 22. 
Auflage, S. Hirzel Verlag Stuttgart, S. 822-827. Among those amino acids 
presented above alanine, leucine, isoleucine, methionine, valine, tert.-leucine 
or neopentyl glycine are not preferably transformed in a process utilizing the 
modified hydantoinase. 

To transform hydantoins directly to the amino acids by enzymes it is 
preferred to use a whole-cell catalyst which includes the hydantoinase of the 
invention accompanied with a carbamoylase. A hydantoin racemase can also 
be used in addition to the hydantoinase and carbamoylase. 
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The hydantoinase can be used within this process either in their free or 
immobilized form. Also the carbamoylase and hydantoin racemase may be 
immobilized, too. Techniques to immobilize enzymes are well known to the 
skilled worker. Preferred methods are mentioned in Bhavender P. Sharma, 
Lorraine F. Bailey and Ralph A. Messing, immobilisierte Biomaterialien - 
Techniken und Anwendungen, Angew. Chem. 1992, 94, 836-852; Dordick 
etal., J. Am. Chem. Soc. 194, 116, 5009-5010; Okahata et al., Tetrahedron 
Lett. 1997, 38, 1971-1974; Adlercreutz et al., Biocatalysis 1992, 6, 291-305; 
Goto etal., Biotechnol. Prog. 1994, 10, 263-268; Kamiya etal., Biotechnol. 
Prog. 1995, 11, 270-275; Okahata etal., Tibtech, February 1997, 15, 50-54; 
Fishman et al., Biotechnol. Lett. 1998, 20, 535-538). 

The transformation discussed can be conducted in a batch process or 
continuous manner. Advantageously, an enzyme-membrane-reactor is used as 
the reaction vessel (Wandrey etal. in Jahrbuch 1998, Verfahrenstechnik und 
Chemieingenieurwesen, VDI S. 15 Iff.; Wandrey etal. in Applied Homogeneous 
Catalysis with Organometallic Compounds, Vol. 2, VCH 1996, S.832 ff.; Kragl 
et al., Angew. Chem. 1996, 6, 684f). 

A further embodiment of the present invention is directed to a whole cell 
catalyst comprising a gene encoding for a carbamoylase, an optional racemase 
and a hydantoinase wherein the hydantoinase is considered to be according to 
the modified hydantoinase of the invention. 

Advantageously, a bacteria is used as a cell, because of high 
reproduction rates and easy growing conditions to be applied. There are 
several bacteria known to the skilled worker which can be utilized in this 
respect. Preferably E. coli can be used as the cell and expression system in 
this regard (Yanisch-Perron et al., Gene (1985), 33, 103-109). 
Another aspect of the invention is a process for the production of 
enantiomerically enriched amino acids, which utilizes a whole cell catalyst 
according to the invention. 

It is further preferred in this respect that amino acids like methionine, 
threonine, lysine or tert.-leucine are produced by the aid of the whole cell 
catalyst. 

The transformation discussed in this instance can be conducted in a 
batch process or continuous manner. Advantageously, an enzyme-membrane- 
ractor is used as the reaction vessel (Wandrey et al. in Jahrbuch 1998, 
Verfahrenstechnik und Chemieingenieurwesen, VDI S. 15 Iff.; Wandrey et al. in 
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Applied Homogeneous Catalysis with Organometallic Compounds, Vol. 2, VCH 
1996, S832 ff.; Kragl et al., Angew. Chem. 1996, 6, 684f; DE 19910691.6).' 

There is a further aspect of the invention, which is directed to a process 
for the production of a whole cell catalyst of the invention. The process is 
5 preferably conducted by using expression vectors pOM17, pOM18, pOM20, 

pOM22 and/or pOM21. In addition primers of SEQ. NO. 17, SEQ. NO. 18, 
SEQ. NO. 15 and/or SEQ. NO. 16 are used with regard to the production of the 
whole cell catalyst. 

Examples of Practice are as follows: 

Example 1 

The following example provides additional details regarding the 
procedures used to identify and isolate the modified hydantoinases in 
accordance with the present invention. 

The hydantoinase from Arthrobacter sp. DSM 9771 (U.S. Pat. No. 
5,714,355) was cloned by polymerase chain reaction (PCR). The nucleotide 
sequence was determined and compared to other hydantoinases from closely 
related Arthrobacter strains. The nucleotide and amino acid sequences for the 
hydantoinase are set forth in SEQ. ID. NOS. 1 and 2, respectively. The cloned 
enzymes from Arthrobacter sp. DSM 9771 share about 97.5% identity based on 
their nucleotide sequence (corresponding to 7 amino acid changes) with the 
enzymes from Arthrobacter aurescens DSM 3747 and DSM 3745. The enzymes 
were expressed in £.co/t JM109 using a rhamnose inducible vector construct 
which was provided by the Institute of Industrial Genetics, Universitat 
Stuttgart (Germany). The restriction map is set forth in FIG. 1. 

The hydantoinase was subjected to random mutagenesis using error- 
prone PCR. Approximately 10,000 clones were screened using a pH-indicator 
assay as described below: 

1. Seed culture plates: plates containing 100 jil/well LBamp were 
inoculated with single colonies/ well and incubated for 24 hours at 
30°C, 250 rpm. 

2. Main culture plates: cells from seed culture plates were transferred with 
a 96-pin replicator into plates containing 200 jil LBamp + 0.2% 

35 rhamnose. Plates were incubated for 24 hours at 30°C, 250 rpm. 
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3. Assay: using a pipetting robot, the culture of each well was mixed by 
pipetting up and down (3x) and transferred (75^1 each) into two fresh 
plates. The two plates are filled with 100 nl/well freshly prepared 
substrate solution (80 mM D-MTEH and L-MTEH respectively, in 0.05 
g/1 cresol red pH 8.6). The absorbance at 580 nm is measured 
immediately after the substrate was added to the plate and after 3 
hours incubation at room temperature. The activity was calculated as 
follows: 

Activity = (A580(Oh)-A580(3h))/((A580(Oh)-0.8)) (Rem:0.8 is the absorbance without cells) 

For screening purposes, the ratio of activities for the D- and L-enantiomers is 
taken as an indicator for changed enantioselectivity. 

Since the ratio of activities for different enantiomers in the screening 
tests is only a first hint of enantioselectivity, the identified mutants were 
confirmed by chiral HPLC using the racemic substrate as follows. 2ml 
overnight cultures were added to 2ml 80 mM D L-MTEH in 0.1M Tris pH8.5 
and incubated at 37°C. After lh and 2h respectively, the reaction mixture was 
centrifuged for 2 minutes, 14000 rpm. 20 \d of the supernatant was applied 
onto the HPLC column and the various fractions eluted. 

About 2% of the population showed a significantly higher (>50%) 
activity compared to wild-type DSM 9771. Although a considerable number of 
those clones might be false positives due to common variation of expression 
level in a population, about 50% of rescreened clones were indeed higher 
activity mutants. The high number indicates that hydantoinase has a large 
evolutionary potential (its activity and enantioselectivity can be therefor 
improved). This can be rationalized since a high Km value (about 15mM), a 
rather low specific activity (about 12U/mg) and a low expression level (<10% of 
total protein) leaves room for improvements of this enzyme. 

Table 3 shows the results of the tested mutants. Mutant 1CG7 shows a 
dramatic increase of (D-) selectivity. Compared to wild-type, the enantiomeric 
excess of the product is 4 times increased. The enantioselectivity of clone 
11DH7 and 19 AG 11 was changed into the opposite direction since both 
mutants are absolutely non-selective. The activity mutant 1BF7 possesses the 
same enantioselectivity as wild-type. 
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TABLE 3 


Clone 


Conversion 


enantiomeric 






excess [%] 


wild-type 


42% after 2 hours 


19 


1CG7 


42% after 2 hours 


90 


11DH7 


42% after 2 hours 


0 


19AG11 


37% after 2 hours 


0 


1BF7 


45% after 1 hour 


19 


All of the mutants were sequenced and the nucleotide and amino acid 


sequences established as set forth in Table 4. 






TABLE 4 






Nucleotide Sequence 


Amino Acid Sequence 




(SEO. ID. NO.) 


(SEO. ID. NO.) 


1CG7 


3 


4 


11DH7 


5 


6 


1BF7 


7 


8 


19AG11 


11 


12 



A second round of random mutagenesis was conducted using the first 

10 generation mutant 1 1DH7 as the parent. 

Two different libraries with different error rates (20% and 50% inactive 
clones) were produced and 10,000 clones of each libarry were screened using 
the above-described pH-indicator method. None of the screened clones showed 
significantly higher L-selectivity but mutants with improved activity and higher 

15 D-selectivity were found. One mutant (22CG2) differing in only one amino acid 

(VI 80 A) from the parent was found to be 4-fold more active compared to parent 
11DH7. 

Sequencing of the first generation mutants 11DH7 and 19AG11 
revealed a single mutation (I95L) is responsible for their decreased D- 
20 selectivity. Introducing all 20 different amino acids into amino acid position 95 

of mutant 22CG2 by saturation mutagenesis and screening of about 400 
clones revealed a new mutant (Q2H4) with significantly improved L-selectivity 
(eeL=20%) and 1.5-fold improved activity compared to its parent 22CG2. The 
results of HPLC analysis for enantioselectivity are shown in FIG. 2. The 
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nucleotide and amino acid sequences for 22CG2 are set forth in SEQ. ID. NOS. 
11 and 12, respectively. The nucleotide and amino acid sequences for Q2H4 
are set forth in SEQ. ID. NOS. 13 and 14, respectively. 

In addition to the improvements provided by the mutants described 
5 above, the activity of the whole cell catalyst could be increased by a factor of 10 

by addition of 1 raM manganese to the growth medium and to the substrate 
solution. Under those conditions the activity of mutant 22CG2 was 
determined to be about 380 U/gCDW which is a 50-fold increase in activity 
compared to the activity described for Arthrobacter sp. DSM 9771. A 
10 comparison of the activity of mutant Q2H4 with other strains is given in 

FIG. 4. 

A summary of the enzymatic activities of the various modified enzymes 
with respect to the parent DSM 9771 is set forth in FIG. 3. As can be seen 
from FIG. 3, all of the modified enzymes identified in accordance with the 

15 present invention have activities and/or enantioselectivity which are better 

than the unmodified DSM 9771 hydantoinase. When tested under standard 
conditions by HPLC, the Q2H4 mutant showed inverted enantioselectivity for 
the hydrolysis of D,L-MTEH. Q2H4 produced N-carbamoyl-L-methionine with 
an enantiomeric excess (ee) of 20% at about 30% conversion. In addition, the 

20 Q2H4 mutant was approximately 1 .5-fold more active than its parent 22CG2. 

Example 2 

In a further example, L-methionine was produced with a recombinant 
whole cell catalyst. Recombinant whole cell catalysts were prepared by co- 
25 expressing the evolved or wild-type hydantoinase with a hydantoin racemase 

and a L-carbamoylase in E.coli as follows. 

Strains and expression vectors. The L-carbamoylase and hydantoinase 
expression vector pOM17 and pOM18 were constructed by PCR amplification 

30 of the hyuC and hyuH gene, respectively, from Arthrobacter sp. DSM 9771 

using the following primer: for ftt/uC-amplification: 5'-AGGCGACATA- 
TGACCCTGCAGAAAGCGCAA-3' (SEQ. ID. NO. 17), 5-ATGGGATCCCCGGT- 
CAAGTGCCTTCATTAC-3' (SEQ. ID. NO. 18); for hyuH-amplification: 5'- 
AGAACATATGTTTGACGTAATAGTTAAGAA-3' (SEQ. ID. NO. 15), 5 -AAAAGGAT- 

35 CCTCACTTCGACGCCTCGTA-3' (SEQ. ID. NO. 16). The amplified fragments 

were cleaved with the restriction enzymes Ndel and BamHI and inserted using 
the same restriction enzymes downstream the rha BAD promotor (rhamnose 
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promotor) into the vector pJOE2702 (Volff, J.-N., Eichenseer, C., Viell, P., 
Piendl, W. & Altenbuchner, J. (1996) Nucleotide sequence and role in DNA 
amplification of the direct repeats composing the amplificable element AUDI of 
Streptomyces lividans 66. Mol Microbiol 21, 1037-1047). The co-expression 
plasmid pOM20 comprising the L-carbamoylase and hydantoinase gene, both 
separately under the control of a rhamnose promotor, was derived from 
Plasmid pOM17 and pOM18. pOM17 was digested by Sail and treated with 
the Klenow fragment to form blunt ends. pOM18 was digested by BamHI and 
also treated with the Klenow fragment to form blunt ends. Both fragments 
were subsequently digested from Hindll. The 1521 kb-fragment comprising 
the carbamoylase gene and rhamnose promotor derived from pOM17 was 
ligated with the 5650kb-fragment of the digested pOM18 to yield pOM20. 
Mutations of the L-selective hydantoinase were introduced into pOM20 using 
the restriction enzymes RsrII and KasI which yielded pOM22. The racemase 
expression vector pOM21 was derived from pACYC184 (Rose, R.E. The 
nucleotide sequence of pACYC184. Nucleic Acids Res. 16, 355 (1988)) and 
carries a chloramphenicol selection marker and the racemase gene hyuR from 
Arthrobacter sp. DSM3747 under the control of the rhamnose promotor. All 
plasmids were routinely transformed into E.coli JM109 (Yanisch-Perron, C, 
Viera, J. & Messing, J. (1984) Improved M13 phage cloning vectors and host 
strains: nucleotide sequences of the M13 mpl8 and pUC vectors. Gene 33, 
103-109). The hydantoin converting pathway was installed in E.coli JM109 by 
transformation of pOM20 and pOM22, respectively into E.coli JM109 (pOM21). 
Cells were either grown in LB liquid medium or on LB-agar plates (Luria. S.E., 
Adams, J.N. & Ting, R.C. (1960) Transduction of lactose-utilizing ability among 
strains of Escherichia coli and Shigella dysenteriae and properties of phage 
particles. Virology 12, 348-390), both supplemented with the respective 
antiobiotics for the growth and expression medium (100 ^g/ml ampicillin, 50 
Hg/ml chloramphenicol) and addition of 2 mg/ml rhamnose for the expression 
medium. 

Error-prone PCR, Random mutagenesis of the hydantoinase gene was 
performed in a 100 nl reaction mix containing 0.25 ng of plasmid DNA as 
template, Boehringer PCR buffer (10 mM Tris, 1.5 mM MgCla, 50 mM KCI, pH 
8.3), 200 nM dATP, 200 ^M dTTP, 200 \iM dGTP, 200 \iM dCTP, 50 pmol of 
each primer, and 2.5U Taq polymerase (Boehringer). After 30 cycles, the 1667 
amplification product was extracted from gel using the QiaexII gel-extraction 
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kit (Qiagen, Valencia, CA) and subcloned into vector pJOE2702 using the 
EcoRI and HindUI restriction sites. Religation frequency of alkaline 
phosphatase treated vector was below 1%. 

5 Saturation mutagenesis. For randomization of the codon for amino acid 

position 95, the QuickChange™ protocol (Stratagene, La Jolla, CA) was used. 
About 10 ng plasmid from clone 22CG2 were aplified by PCR using two 
complimentary oligonucleotides (5-CATCGAGATGCCGNNNACCTTCCCG- 
CCCAC-3 , > 5-GTGGGCGGGAAGGTNNNCGGCATCTCGATG-3 1 ). After PCR 
10 amplification the reaction mixture was treated for 2 hours with 20 U of the 

restriction enzyme Dpnl. Transformation of 10 |J Dpnl digested reaction 
mixture into competent cells yielded a library of more than 2000 mutants of 
which about 400 were screened. 

15 Preparation of library and screening. Single colonies of transformed E.coli 

were transferred into 384-well plates (master plates) using the robot system 
Qbot (Genetix, Dorset, UK). After 20 hour growth at 37°C plates were stored at 
-80°C. For subsequent screening, plates were thawed and replicated into 96- 
well plates containing 200 \il per well inductor medium. A Biomek 1000 

20 pipetting workstation (Beckman, Pullerton, CA) was used to divide the 24 

hours at 30°C incubated plate into two fresh 96-well plates one containing 
100 jjQ 80 mM L-MTEH the other 100 ^1 80mM D-MTEH in 50 mg/1 cresol red 
solution adjusted to pH 8.5. Initial absorbance at 580 nm and after 3 hours 
incubation at room temperature were measured using a THERMOmax plate 

25 reader (Molecular Devices, Sunnyvale, CA). Activity was calculated from the 

difference of initial and absorbance after 3 hours incubation divided by the cell 
density of each well. For the saturation mutagenesis library incubation time 
was reduced to 1.5 hours. The ratio of activity towards the L- and D- 
enantiomer was taken as a first indicator for enantioselectivity. Identified 

30 clones were then tested using the racemic substrate under conditions 

described below. 

Characterization of activity and enantioselectivity. Plasmid of mutant 
found to be positive in the screen was sequenced and retransformed into E.coli. 
35 A culture of retransformed E.coli was grown for 16-18 hours (until OD10) in 

inductor medium supplemented with ImM MnCk. 2 ml substrate solution 
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consisting of 80 mM D,L-MTEH, 0.1M Tris pH 8.5, 1 mM MnCl 2 (pre-incubated 
at 37°C) were added to 2 ml cell culture (OD600-7). The reaction mixture was 
immediately incubated at 37°C in a water bath. After different time periods (as 
specified in the text) 1 ml samples were taken and centrifuged for 5 minutes at 
14,000 rpm. 20 \xl of supernatant were analyzed by chiral HPLC using a 
column manufactured by Degussa-Huels AG. Activity was calculated from the 
amount of produced N-carbamoyl-D,L-methionine and expressed as U/ml cell 
culture of U/mg cell dry weight (CDW) were 1U is the amount of whole-cell 
catalyst to produce 1 yunol N-carbamoyl-D,L-methionine in one minute under 
stated reaction conditions. Enantioselectivity of the hydantoinase and its 
mutants were compared by calculating the percentage of ee D ((D-L)/(D+L)) and 
eet ((L-D)/(L+D) respectively for the product at various extents of conversion. A 
conventional determination of E (enantiomeric ratio) from ee-values and the 
extent of conversion as described by Chen et al. (Chen, C.S., Fujimoto, Y., 
Girdaukas, G. & Sih, C.J. (1982) Quantitative analysis of biochemical kinetic 
resolutions of enantiomers. J. Am. Chem. Soc. 104, 7294-7299) is not possible 
because of the fast racemization of the substrate. 

Conversion of D,L-MTEH into L-met. 8 mg cell dry mass of E.coli JM109 
(pOM20 & pOM21) and E.coli JM109 (pOM 22 & pOM21) were added to 4 ml 
100 mM D,L-MTEH in 0.1 M Tris pH 7.8 supplemented with ImM MnCl 2 . The 
reaction mixture was incubated at 37°C. Samples were analyzed after 
indicated periods of time and analyzed by HPLC for MTEH, D,L-C met, and 
D,L-met as described in Volkel, D. & Wagner, F. Reaction (1995) mechanism 
for the conversion of 5-monosubstituted hydantoins to enantiomerically pure 
L-amino acids. Ann. NY Acad. Sci. 750, 1-9. The optical purity of the 
compounds was analyzed by chiral HPLC as described above. 

As shown in FIG. 5, the conversion of D,L-MTEH into L-met is 
significantly improved for the catalyst with the evolved hydantoinase. After 
three hours, approximately 60 mM L-met was produced from 100 mM D,L- 
MTEH, whereas the whole cell catalyst with the wild- type pathway produced 
only 10 mM of the amino acid. The concentration of the D-C met intermediate 
was reduced by a factor of 4 and the productivity for the production of L-amino 
acid was 8-fold increased during the first hour of the reaction. 
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Example 3 

The following example shows that production of L-methionine was 
significantly improved with an evolved hydantoinase of mutant 22CG2 which 
has improved activity and is not enantioselective (0% enantiomeric excess at 
42% conversion, see Table 3). Mutations of the evolved hydantoinase from 
mutant 22CG2 were introduced into pOM20 as previously described using the 
restriction enzymes Rsrll and Kasl, which yielded pOM23. This co-expression 
vector was transformed into E.coli JM109 (pOM21). The resulting whole cell 
catalyst E.coli JM109 (pOM21/pOM23) and E.coli JM109 (pOM21/pOM20) 
were used for conversion of D,L-MTEH into methionine. 125 mg cell dry mass 
of the respective cells were added to 5 ml substrate solution (100 mM D,L- 
MTEH in 0.9% NaCl, ImM MnCl 2 , pH 7.8) and incubated for 1 hour at 37°C. 
FIG. 6 shows the production of methionine (Met) and N-carbamoyl-methionine 
(C-Met) from D,L-MTEH for both catalysts. The whole cell catalyst with the 
improved hydantoinase from clone 22CG2 produced about 65 mM methionine 
within one hour whereas whole cell catalyst with the wild-type hydantoinase 
produced only 8 mM during the same reaction time. This demonstrates that 
an evolved hydantoinase without significant enantioselectivity but improved 
activity leads to a significant improvement for the production of methionine. 

Having thus described exemplary embodiments of the present 
invention, it should be noted by those skilled in the art that the within 
disclosures are exemplary only and that various other alternatives, 
adaptations, and modifications may be made within the scope of the present 
invention. Accordingly, the present invention is not limited to the specific 
embodiments as illustrated herein, but is only limited by the following claims. 
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CLAIMS 

What is claimed is : 

1. A modified hydantoinase having improved enzymatic properties 
relative to unmodified hydantoinase of SEQ. ID. NO: 1, wherein said modified 
hydantoinase consists of unmodified hydantoinase which has been modified by 
an amino acid substitution at one or more amino acid positions selected from 
the group consisting of amino acid position numbers 95, 154, 180, 251 and 
255 and equivalent positions of other hydantoinases. 

2. A modified hydantoinase according to claim 1 wherein said one 
or more amino acid substitutions are selected from the group consisting of 
I95F, I95L, V154A, V180A, Q251R and V255A. 

3. A modified hydantoinase according to claim 1 wherein said 
unmodified hydantoinase is modified by the amino acid substitutions I95F, 
V180A and Q251R. 

4. A modified hydantoinase according to claim 1 wherein said 
unmodified hydantoinase is modified by the amino acid substitutions V180A 
and Q251R. 

5. A modified hydantoinase according to claim 1 wherein said 
unmodified hydantoinase is modified by the amino acid substitutions I95L and 
Q251R. 

6. A modified hydantoinase according to claim 1 wherein said 
unmodified hydantoinase is modified by the amino acid substitution V154A. 

7. A modified hydantoinase according to claim 1 wherein said 
unmodified hydantoinase is modified by the amino acid substitution V255A. 

9. A modified hydantoinase according to claim 1 wherein said 
unmodified hydantoinase is modified by the amino acid substitution I95L. 

10. A nucleic acid segment comprising a region which encodes a 
modified hydantoinase having improved enzymatic properties relative to 
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unmodified hydantoinase of SEQ. ID. NO. 1, wherein said modified 
hydantoinase consists of unmodified hydantoinase which has been modified by 
an amino acid substitution at one or more amino acid positions selected from 
the group consisting of amino acid position numbers 95, 154, 180, 251 and 
255 and equivalent positions of other hydantoinases. 

11. A nucleic acid segment according to claim 10 wherein said one 
or more amino acid substitutions are selected from the group consisting of 
I95F, I95L, V154A, V180A, Q251R and V255A. 

12. An expression vector containing a nucleic acid segment 
according to claim 10. 

13. An expression vector containing a nucleic acid segment 
according to claim 11. 

14. A host cell comprising a nucleic acid segment according to 
claim 10. 

15. A host cell comprising a nucleic acid segment according to 
claim 11. 

16. A method for producing a modified hydantionase wherein said 
method comprises the steps of: 

expressing a nucleic acid segment encoding said modified 
hydantoinase according to claim 10 in a suitable host cell to produce modified 
hydantoinase; and 

recovering said modified hydantoinase produced by said 
expression step. 

17. A method for producing a modified hydantionase wherein said 
method comprises the steps of: 

expressing a nucleic acid segment encoding said modified 
hydantoinase according to claim 11 in a suitable host cell to produce modified 
hydantoinase; and 

recovering said modified hydantoinase produced by said 
expression step. 
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18. Process for the production of amino acids utilizing a modified 
hydantoinase according to claim 1, 

19. Process according to claim 18 characterized in that a modified 
hydantoinase according to claim 2 is utilized. 

20. Process according to claim 18 characterized in that accompanied 
with the hydantoinase a carbamoylase and a hydantoin racemase is used. 

21. Process according to claim 18 characterized in that free or 
immobilized hydantoinase is used. 

22. Process according to claim 18 characterized in that alanine, 
leucine, isoleucine, methionine, valine or tert. -leucine or neopentyl glycine are 
produced. 

23. Process according to claim 18 characterized in that the process 
is performed in an enzyme-membrane-reactor. 

24. Whole cell catalyst comprising a gene encoding for a 
hydantoinase wherein the hydantoinase is considered to be according to 
claim 1. 

25. A whole cell catalyst according to claim 24 which further 
comprises a gene encoding for a carbamoylase. 

26. A whole cell catalyst according to claim 25 which further 
comprises a gene encoding for a racemase. 

27. A whole cell catalyst according to claim 24 characterized in that 
a bacteria is used as a whole cell. 

28. A whole cell catalyst according to claims 24 characterized in that 
Escherichia coli is used as a whole cell. 
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29. Process for the production of amino acids utilizing whole cell 
catalysts according to claim 24. 

30. Process according to claim 27 characterized in that methionine, 
valine, threonine, lysine or tert.-leucine is produced. 

31. Process according to claims 27 characterized in that the process 
is performed in an enzyme-membrane-reactor. 

32. Process for the production of a whole cell catalyst according to 
claim 24, wherein expression vectors pOM17, pOM18, pOM20, pOM22 and/or 
pOM21 are used. 

33. Process according to claim 32 wherein primers of SEQ. NO. 17, 
SEQ. NO. 18, SEQ. NO. 15 and/or SEQ. NO. 16 are used. 

34. A process for production of alpha-amino acids using a whole cell 
catalyst containing an improved hydandoinase of claim 1 where the N- 
carbamoyl-D-amino acid or N-carbamoyl-L-amino acid that is produced by the 
action of the hydantoinase is further chemically or enzymatically hydrolyzed to 
produce the amino acid and where the racemization of the hydantoin is 
achieved either chemically or enzymatically. 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/126,923 

(B) FILING DATE: March 29, 1999 

(A) APPLICATION NUMBER: 60/157,427 

(B) FILING DATE: October 4, 1999 

(A) APPLICATION NUMBER: 09/497,585 

(B) FILING DATE: February 3, 2000 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Oldenkamp, David J. 

(B) REGISTRATION NUMBER: 29,421 

(C) REFERENCE/DOCKET NUMBER: 330187-93 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (310) 788-5000 

(B) TELEFAX: f 310) 277-1297 

(2) INFORMATION FOR SEQ ID NO: 1: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: nucleic acid 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1 



atgtttgacg taatagttaa gaactgccgt atggtgtcca gcgacggaat caccgaggca 60 

gacattctgg tgaaagacgg caaagtcgcc gcaatcagct cggacacaag tgatgttgag 120 

gcgagccgaa ccattgacgc gggtggcaag ttcgtgatgc cgggcgtggt cgatgaacat 180 

gcgagccgaa ccattgacgc gggtggcaag ttcgtgatgc cgggcgtggt cgatgaacat 24 0 

tctgcggccg tgggaggcat caccaccatc atcgagatgc cgataacctt cccgcccacc 300 

accactttgg acgccttcct cgaaaagaag aagcaggcgg ggcagcggtt gaaagttgac 360 

ttcgcgctct atggcggtgg agtgccggga aacctgcccg agatccgcaa aatgcacgac 420 

gccggcgcag tgggcttcaa gtcaatgatg gcagcctcag ttccgggcat gttcgacgcc 4 80 

gtcagcgacg gcgaactgtt cgaaatcttc caggagatcg cagcctgtgg ttcagtcgtc 540 

gtggtccatg ccgagaatga aacgatcatt caagcgctcc agaagcagat caaagccgct 600 

ggtcgcaagg acatggccgc ctacgaggca tcccaaccag ttttccagga gaacgaggcc 660 

attcagcgtg cgttactact gcagaaagaa gccggctgtc gactgattgt gcttcacgtg 720 

agcaaccctg acggggtcga gctgatacat caggcgcaat ccgagggcca ggacgtccac 780 

tgcgagtcgg gtccgcagta tctgaatatc accacggacg acgccgaacg aatcggaccg 840 

tatatgaagg tcgcgccgcc cgtccgctca gccgagatga acgtcagatt atgggaacaa 900 

cttgagaacg ggctcatcga cacccttggg tcagaccacg gcggacatcc tgtcgaggac 960 

aaagaacccg gctggaagga cgtgtggaaa gccggcaacg gtgcgctggg ccttgagaca 1020 

tccctgccta tgatgctgac caacggagtg aataaaggca ggctatcctt ggaacgcctc 1080 

gtcgaggtga tgtgcgagaa acctgcgaag ctctttggca tctatccgca gaagggcacg 1140 

ctacaggttg gttccgacgc cgatctgctc atcctcgatc tggatattga caccaaagtg 1200 

gatgcctcgc agttccgatc cctgcataag tacagcccgt tcgacgggat gcccgtcacg 1260 

ggtgcaccgg ttctgacgat ggtgcgcgga acggtggtgg cagagaaggg agaagttctg 1320 

gtcgagcagg gattcggcca gttcgtcacc cgtcacgact acgaggcgtc gaagtga 1377 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 4 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(di) MOLECULE TYPE: protein 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

Met Phe Asp Val lie Val Lys Asn Cys Arg Met Val Ser Ser Asp Gly 
5 10 15 

He Thr Glu Ala Asp He Leu Val Lys Asp Gly Lys Val Ala Ala He 
20 25 30 

Ser Ser Asp Thr Ser Asp Val Glu Ala Ser Arg Thr He Asp Ala Gly 
35 40 45 

Gly Lys Phe Val Met Phe Gly Val Val Asp Glu His Val His He He 
50 55 60 

Asp Met Asp Leu Lys Asn Arg Tyr Gly Arg Phe Glu Leu Asp Ser Glu 
65 70 75 80 
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Ser Ala Ala Val Gly Gly He Thr Thr He He Glu Met Pro He Thr 
85 90 95 

Phe Pro Pro Thr Thr Thr Leu Asp Ala Phe Leu Glu Lys Lys Lys Gin 
100 105 HO 

Ala Gly Gin Arg Leu Lys Val Asp Phe Ala Leu Tyr Gly Gly Gly Val 
US 120 125 

Pro Gly Asn Leu Pro Glu lie Arg Lys Met His Asp Ala Gly Ala Val 
130 135 140 

Gly Phe Lys Ser Met Met Ala Ala Ser Val Pro Gly Met Phe Asp Ala 
145 150 155 160 

Val Ser Asp Gly Glu Leu Phe Glu He Phe Gin Glu He Ala Ala Cys 
165 170 175 

Gly Ser Val Val Val Val His Ala Glu Asn Glu Thr He He Gin Ala 
180 185 190 

Leu Gin Lys Gin He Lys Ala Ala Gly Arg Lys Asp Met Ala Ala Tyr 
195 200 205 

Glu Ala Ser Gin Pro Val Phe Gin Glu Asn Glu Ala He Gin Arg Ala 
210 215 220 

Leu Leu Leu Gin Lys Glu Ala Gly Cys Arg Leu He Val Leu His Val 
225 230 235 240 

Ser Asn Pro Asp Gly Val Glu Leu He His Gin Ala Gin Ser Glu Gly 
245 250 255 

Gin Asp Val His Cys Glu Ser Gly Pro Gin Tyr Leu Asn He Thr Thr 
260 265 270 

Asp Asp Ala Glu Arg He Gly Pro Tyr Met Lys Val Ala Pro Pro Val 
275 280 285 

Arg Ser Ala Glu Met Asn Val Arg Leu Trp Glu Gin Leu Glu Asn Gly 
290 295 300 

Leu He Asp Thr Leu Gly Ser Asp His Gly Gly His Pro Val Glu Asp 
305 310 315 320 

Lys Glu Pro Gly Trp Lys Asp Val Trp Lys Ala Gly Asn Gly Ala Leu 
325 330 335 

Gly Leu Glu Thr Ser Leu Pro Met Met Leu Thr Asn Gly Val Asn Lys 
340 345 350 

Gly Arg Leu Ser Leu Glu Arg Leu Val Glu Val Met Cys Glu Lys Pro 
355 360 365 



Ala Lys Leu Phe Glu He Tyr Pro Gin Lys Gly Thr Leu Gin Val Gly 
370 375 380 
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Ser Asp Ala Asp Leu Leu lie Leu 
385 390 

Asp Ala Ser Gin Phe Arg Ser Leu 
405 

Met Pro Val Thr Gly Ala Pro Val 
420 

Val Ala Glu Lys Gly Glu Val Leu 
435 440 

Val Thr Arg His Asp Tyr Glu Ala 
450 455 



Asp Leu Asp lie Asp Thr Lys Val 
395 400 

His Lys Tyr Ser Pro Phe Asp Gly 
410 415 

Leu Thr Met Val Arg Gly Thr Val 
425 430 

Val Glu Gin Gly Phe Gly Gin Phe 
445 

Ser Lys 
458 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



atgtttgacg taatagttaa gaactgccgt atggtgtcca gcgacggaat caccgaggca 60 

gacattctgg tgaaagacgg caaagtcgcc gcaatcagct cggacacaag tgatgttgag 120 

gcgagccgaa ccattgacgc gggtggcaag ttcgtgatgc cgggcgtggt cgatgaacat 180 

gcgagccgaa ccattgacgc gggtggcaag ttcgtgatgc cgggcgtggt cgatgaacat 240 

tctgcggccg tgggaggcat caccaccatc atcgagatgc cgataacctt cccgcccacc 300 

accactttgg acgccttcct cgaaaagaag aagcaggcgg ggcagcggtt gaaagttgac 360 

ttcgcgctct atggcggtgg agtgccggga aacctgcccg agatccgcaa aatgcacgac 420 

gccggcgcag tgggcttcaa gtcaatgatg gcagcctcag ctccgggcat gttcgacgcc 480 

gtcagcgacg gcgaactgtt cgaaatcttc caggagatcg cagcctgtgg ttcagtcgtc 540 

gtggtccatg ccgagaatga aacgatcatt caagcgctcc agaagcagat caaagccgct 600 

ggtcgcaagg acatggccgc ctacgaggca tcccaaccag ttttccagga gaacgaggcc 660 

attcagcgtg cgttactact gcagaaagaa gccggctgtc gactgattgt gcttcacgtg 720 

agcaaccctg acggggtcga gctgatacat caggcgcaat ccgagggcca ggacgtccac 780 

tgcgagtcgg gtccgcagta tctgaatatc accacggacg acgccgaacg aatcggaccg 840 

tatatgaagg tcgcgccgcc cgtccgctca gccgagatga acgtcagatt atgggaacaa 900 

cttgagaacg ggctcatcga cacccttggg tcagaccacg gcggacatcc tgtcgaggac 960 

aaagaacccg gctggaagga cgtgtggaaa gccggcaacg gtgcgctggg ccttgagaca 1020 

tccctgccta tgatgctgac caacggagtg aataaaggca ggctatcctt ggaacgcctc 1080 

gtcgaggtga tgtgcgagaa acctgcgaag ctctttggca tctatccgca gaagggcacg 1140 

ctacaggttg gttccgacgc cgatctgctc atcctcgatc tggatattga caccaaagtg 1200 

gatgcctcgc agttccgatc cctgcataag tacagcccgt tcgacgggat gcccgtcacg 1260 

ggtgcaccgg ttctgacgat ggtgcgcgga acggtggtgg cagagaaggg agaagttctg 1320 

gtcgagcagg gattcggcca gttcgtcacc cgtcacgact acgaggcgtc gaagtga 1377 



WO 00/58449 PCT/US00/08159 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

Met Phe Asp Val He Val Lys Asn Cys Arg Met Val Ser Ser Asp Gly 
5 10 15 

He Thr Glu Ala Asp He Leu Val Lys Asp Gly Lys Val Ala Ala He 
20 25 30 

Ser Ser Asp Thr Ser Asp Val Glu Ala Ser Arg Thr He Asp Ala Gly 
35 40 45 

Gly Lys Phe Val Met Phe Gly Val Val Asp Glu His Val His He He 
50 55 60 

Asp Met Asp Leu Lys Asn Arg Tyr Gly Arg Phe Glu Leu Asp Ser Glu 
65 70 75 80 

Ser Ala Ala Val Gly Gly He Thr Thr He He Glu Met Pro He Thr 
85 90 95 

Phe Pro Pro Thr Thr Thr Leu Asp Ala Phe Leu Glu Lys Lys Lys Gin 
100 105 HO 

Ala Gly Gin Arg Leu Lys Val Asp Phe Ala Leu Tyr Gly Gly Gly Val 
115 120 125 

Pro Gly Asn Leu Pro Glu He Arg Lys Met His Asp Ala Gly Ala Val 
130 135 140 

Gly Phe Lys Ser Met Met Ala Ala Ser Ala Pro Gly Met Phe Asp Ala 
145 150 155 160 

Val Ser Asp Gly Glu Leu Phe Glu He Phe Gin Glu He Ala Ala Cys 
165 170 175 

Gly Ser Val Val Val Val His Ala Glu Asn Glu Thr He He Gin Ala 
180 185 190 

Leu Gin Lys Gin He Lys Ala Ala Gly Arg Lys Asp Met Ala Ala Tyr 
195 200 205 

Glu Ala Ser Gin Pro Val Phe Gin Glu Asn Glu Ala He Gin Arg Ala 
210 215 220 



Leu Leu Leu Gin Lys Glu Ala Gly Cys Arg Leu He Val Leu His Val 
225 230 235 240 
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Ser Asn Pro Asp Gly Val Glu Leu lie His Gin Ala Gin Ser Glu Gly 
245 250 255 

Gin Asp Val His Cys Glu Ser Gly Pro Gin Tyr Leu Asn He Thr Thr 
260 265 270 

Asp Asp Ala Glu Arg He Gly Pro Tyr Met Lys Val Ala Pro Pro Val 
275 280 285 

Arg Ser Ala Glu Met Asn Val Arg Leu Trp Glu Gin Leu Glu Asn Gly 
290 295 300 

Leu He Asp Thr Leu Gly Ser Asp His Gly Gly His Pro Val Glu Asp 
305 310 315 320 

Lys Glu Pro Gly Trp Lys Asp Val Trp Lys Ala Gly Asn Gly Ala Leu 
325 330 335 

Gly Leu Glu Thr Ser Leu Pro Met Met Leu Thr Asn Gly Val Asn Lys 
3^0 345 35 0 

Gly Arg Leu Ser Leu Glu Arg Leu Val Glu Val Met Cys Glu Lys Pro 
355 360 365 

Ala Lys Leu Phe Glu He Tyr Pro Gin Lys Gly Thr Leu Gin Val Gly 
370 375 380 

Ser Asp Ala Asp Leu Leu He Leu Asp Leu Asp He Asp Thr Lys Val 
385 390 395 400 

Asp Ala Ser Gin Phe Arg Ser Leu His Lys Tyr Ser Pro Phe Asp Gly 
405 410 415 

Met Pro Val Thr Gly Ala Pro Val Leu Thr Met Val Arg Gly Thr Val 
420 425 430 

Val Ala Glu Lys Gly Glu Val Leu Val Glu Gin Gly Phe Gly Gin Phe 
435 440 445 



Val Thr Arg His Asp Tyr Glu Ala Ser Lys 
450 455 458 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: nucleic acid 



WO 00/58449 



7 



PCT/US00/08159 



(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 5 



atgtttgacg 


taatagttaa 


gaactgccgt 


atggtgtcca 


gcgacggaat 


caccgaggca 


60 


gacattctgg 


tgaaagacgg 


caaagtcgcc 


gcaatcagct 


cggacacaag 


tgatgttgag 


120 


gcgagccgaa 


ccat tgacgc 


gggtggcaag 


ttcgtgatgc 


cgggcgtggt 


cgatgaacat 


180 


gcgagccgaa 


ccattgacgc 


gggtggcaag 


ttcgtgatgc 


cgggcgtggt 


cgatgaacat 


240 


tctgcggccg 


tgggaggcat 


caccaccatc 


atcgagatgc 


cgttaacctt 


cccgcccacc 


300 


accactttgg 


acgccttcct 


cgaaaagaag 


aagcaggcgg 


ggcagcggtt 


gaaagttgac 


360 


ttcgcgctct 


atggcggtgg 


agtgccggga 


aacctgcccg 


agatccgcaa 


aatgcacgac 


420 


gccggcgcag 


tgggcttcaa 


gtcaatgatg 


gcagcctcag 


ttccgggcat 


gttcgacgcc 


480 


gtcagcgacg 


gcgaactgtt 


cgaaatcttc 


caggagatcg 


cagcctgtgg 


ttcagtcgtc 


540 


gtggtccatg 


ccgagaatga 


aacgatcatt 


caagcgctcc 


agaagcagat 


caaagccgct 


600 


ggtcgcaagg 


acatggccgc 


ctacgaggca 


tcccaaccag 


ttttccagga 


gaacgaggcc 


660 


attcagcgtg 


cgttactact 


gcagaaagaa 


gccggctgtc 


gactgattgt 


gcttcacgtg 


720 


agcaaccctg 


acggggtcga 


gctgatacat 


caggcgcaat 


ccgagggcca 


ggacgtccac 


780 


tgcgagtcgg 


gtccgcagta 


tctgaatatc 


accacggacg 


acgccgaacg 


aatcggaccg 


840 


tatatgaagg 


tcgcgccgcc 


cgtccgctca 


gccgagatga 


acgtcagatt 


atgggaacaa 


900 


cttgagaacg 


ggctcatcga 


cacccttggg 


tcagaccacg 


gcggacatcc 


tgtcgaggac 


960 


aaagaacccg 


gctggaagga 


cgtgtggaaa 


gccggcaacg 


gtgcgctggg 


ccttgagaca 


1020 


tccctgccta 


tgatgctgac 


caacggagtg 


aataaaggca 


ggctatcctt 


ggaacgcctc 


1080 


gtcgaggtga 


tgtgcgagaa 


acctgcgaag 


ctctttggca 


tctatccgca 


gaagggcacg 


1140 


ctacaggttg 


gttccgacgc 


cgatctgctc 


atcctcgatc 


tggatattga 


caccaaagtg 


1200 


gatgcctcgc 


agttccgatc 


cctgcataag 


tacagcccgt 


tcgacgggat 


gcccgtcacg 


1260 


ggtgcaccgg 


ttctgacgat 


ggtgcgcgga 


acggtggtgg 


cagagaaggg 


agaagttctg 


1320 


gtcgagcagg 


gattcggcca 


gttcgtcacc 


cgtcacgact 


acgaggcgtc 


gaagtga 


1377 



(2) INFORMATION FOR SEQ ID NO: 6: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Met Phe Asp Val He Val Lys Asn Cys Arg Met Val Ser Ser Asp Gly 
5 10 15 

He Thr Glu Ala Asp He Leu Val Lys Asp Gly Lys Val Ala Ala He 
20 25 30 

Ser Ser Asp Thr Ser Asp Val Glu Ala Ser Arg Thr He Asp Ala Gly 
35 40 45 

Gly Lys Phe Val Met Phe Gly Val Val Asp Glu His Val His He He 
50 55 60 

Asp Met Asp Leu Lys Asn Arg Tyr Gly Arg Phe Glu Leu Asp Ser Glu 
65 70 75 80 

Ser Ala Ala Val Gly Gly He Thr Thr He He Glu Met Pro Leu Thr 
85 90 95 

Phe Pro Pro Thr Thr Thr Leu Asp Ala Phe Leu Glu Lys Lys Lys Gin 
100 105 110 

Ala Gly Gin Arg Leu Lys Val Asp Phe Ala Leu Tyr Gly Gly Gly Val 
115 120 125 

Pro Gly Asn Leu Pro Glu He Arg Lys Met His Asp Ala Gly Ala Val 
130 135 140 

Gly Phe Lys Ser Met Met Ala Ala Ser Val Pro Gly Met Phe Asp Ala 
145 150 155 160 

Val Ser Asp Gly Glu Leu Phe Glu lie Phe Gin Glu He Ala Ala Cys 
165 170 175 

Gly Ser Val Val Val Val His Ala Glu Asn Glu Thr He lie Gin Ala 
180 185 190 

Leu Gin Lys Gin He Lys Ala Ala Gly Arg Lys Asp Met Ala Ala Tyr 
195 200 205 

Glu Ala Ser Gin Pro Val Phe Gin Glu Asn Glu Ala lie Gin Arg Ala 
210 215 220 

Leu Leu Leu Gin Lys Glu Ala Gly Cys Arg Leu He Val Leu His Val 
225 230 235 240 

Ser Asn Pro Asp Gly Val Glu Leu lie His Arg Ala Gin Ser Glu Gly 
245 250 255 



Gin Asp Val His Cys Glu Ser Gly Pro Gin Tyr Leu Asn lie Thr Thr 
260 265 270 
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Asp Asp Ala Glu Arg lie Gly Pro Tyr Met Lys Val Ala Pro Pro Val 
275 280 285 

Arg Ser Ala Glu Met Asn Val Arg Leu Trp Glu Gin Leu Glu Asn Gly 
290 295 300 

Leu He Asp Thr Leu Gly Ser Asp His Gly Gly His Pro Val Glu Asp 
305 310 315 320 

Lys Glu Pro Gly Trp Lys Asp Val Trp Lys Ala Gly Asn Gly Ala Leu 
325 330 335 

Gly Leu Glu Thr Ser Leu Pro Met Met Leu Thr Asn Gly Val Asn Lys 
340 345 350 

Gly Arg Leu Ser Leu Glu Arg Leu Val Glu Val Met Cys Glu Lys Pro 
355 360 365 

Ala Lys Leu Phe Glu He Tyr Pro Gin Lys Gly Thr Leu Gin Val Gly 
370 375 380 

Ser Asp Ala Asp Leu Leu He Leu Asp Leu Asp He Asp Thr Lys Val 
385 390 395 400 

Asp Ala Ser Gin Phe Arg Ser Leu His Lys Tyr Ser Pro Phe Asp Gly 
405 410 415 

Met Pro Val Thr Gly Ala Pro Val Leu Thr Met Val Arg Gly Thr Val 
420 425 430 

Val Ala Glu Lys Gly Glu Val Leu Val Glu Gin Gly Phe Gly Gin Phe 
435 440 445 

Val Thr Arg His Asp Tyr Glu Ala Ser Lys 
450 455 458 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

atgtttgacg taatagttaa gaactgccgt atggtgtcca gcgacggaat caccgaggca 60 

gacattctgg tgaaagacgg caaagtcgcc gcaatcagct cggacacaag tgatgttgag 120 

gcgagccgaa ccattgacgc gggtggcaag ttcgtgatgc cgggcgtggt cgatgaacat 180 

gcgagccgaa ccattgacgc gggtggcaag ttcgtgatgc cgggcgtggt cgatgaacat 24 0 
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tctgcggccg tgggaggcat caccaccatc atcgagatgc cgataacctt cccgcccacc 300 

accactttgg acgccttcct cgaaaagaag aagcaggcgg ggcagcggtt gaaagttgac 360 

ttcgcgctct atggcggtgg agtgccggga aacctgcccg agatccgcaa aatgcacgac 420 

gccggcgcag tgggcttcaa gtcaatgatg gcagcctcag ttccgggcat gttcgacgcc 480 

gtcagcgacg gcgaactgtt cgaaatcttc caggagatcg cagcctgtgg ttcagtcgtc 54 0 

gtggtccatg ccgagaatga aacgatcatt caagcgctcc agaagcagat caaagccgct 600 

ggtcgcaagg acatggccgc ctacgaggca tcccaaccag ttttccagga gaacgaggcc 660 

attcagcgtg cgttactact gcagaaagaa gccggctgtc gactgattgt gcttcacgtg 720 

agcaaccctg acggggtcga gctgatacat caggcgcaat ccgagggcca ggacgtccac . 780 

tgcgagtcgg gtccgcagta tctgaatatc accacggacg acgccgaacg aatcggaccg 840 

tatatgaagg tcgcgccgcc cgtccgctca gccgagatga acgccagatt atgggaacaa 900 

cttgagaacg ggctcatcga cacccttggg tcagaccacg gcggacatcc tgtcgaggac 960 

aaagaacccg gctggaagga cgtgtggaaa gccggcaacg gtgcgctggg ccttgagaca 1020 

tccctgccta tgatgctgac caacggagtg aataaaggca ggctatcctt ggaacgcctc 1080 

gtcgaggtga tgtgcgagaa acctgcgaag ctctttggca tctatccgca gaagggcacg 1140 

ctacaggttg gttccgacgc cgatctgctc atcctcgatc tggatattga caccaaagtg 1200 

gatgcctcgc agttccgatc cctgcataag tacagcccgt tcgacgggat gcccgtcacg 1260 

ggtgcaccgg ttctgacgat ggtgcgcgga acggtggtgg cagagaaggg agaagttctg 1320 

gtcgagcagg gattcggcca gttcgtcacc cgtcacgact acgaggcgtc gaagtga 1377 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

Met Phe Asp Val He Val Lys Asn Cys Arg Met Val Ser Ser Asp Gly 
5 10 15 
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He Thr Glu Ala Asp He Leu Val Lys Asp Gly Lys Val Ala Ala He 
20 25 30 

Ser Ser Asp Thr Ser Asp Val Glu Ala Ser Arg Thr He Asp Ala Gly 
35 40 45 

Gly Lys Phe Val Met Phe Gly Val Val Asp Glu His Val His He He 
50 55 60 

Asp Met Asp Leu Lys Asn Arg Tyr Gly Arg Phe Glu Leu Asp Ser Glu 
65 70 75 80 

Ser Ala Ala Val Gly Gly He Thr Thr lie lie Glu Met Pro He Thr 
85 90 95 

Phe Pro Pro Thr Thr Thr Leu Asp Ala Phe Leu Glu Lys Lys Lys Gin 
100 105 HO 

Ala Gly Gin Arg Leu Lys Val Asp Phe Ala Leu Tyr Gly Gly Gly Val 
115 120 125 

Pro Gly Asn Leu Pro Glu He Arg Lys Met His Asp Ala Gly Ala Val 
130 135 140 

Gly Phe Lys Ser Met Met Ala Ala Ser Val Pro Gly Met Phe Asp Ala 
145 150 155 160 

Val Ser Asp Gly Glu Leu Phe Glu He Phe Gin Glu He Ala Ala Cys 
165 170 175 

Gly Ser Val Val Val Val His Ala Glu Asn Glu Thr He He Gin Ala 
180 185 190 

Leu Gin Lys Gin He Lys Ala Ala Gly Arg Lys Asp Met Ala Ala Tyr 
195 200 205 

Glu Ala Ser Gin Pro Val Phe Gin Glu Asn Glu Ala He Gin Arg Ala 
210 215 220 

Leu Leu Leu Gin Lys Glu Ala Gly Cys Arg Leu He Val Leu His Val 
225 230 235 240 

Ser Asn Pro Asp Gly Val Glu Leu He His Gin Ala Gin Ser Glu Gly 
245 250 255 

Gin Asp Val His Cys Glu Ser Gly Pro Gin Tyr Leu Asn He Thr Thr 
260 265 270 

Asp Asp Ala Glu Arg He Gly Pro Tyr Met Lys Val Ala Pro Pro Val 
275 280 285 



Arg Ser Ala Glu Met Asn Ala Arg Leu Trp Glu Gin Leu Glu Asn Gly 
290 295 300 



Leu He Asp Thr Leu Gly Ser Asp His Gly Gly His Pro Val Glu Asp 
305 310 315 320 
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Lys Glu Pro Gly Trp Lys Asp Val Trp Lys Ala Gly Asn Gly Ala Leu 
325 330 335 

Gly Leu Giu Thr Ser Leu Pro Met Met Leu Thr Asn Gly Val Asn Lys 
340 345 350 

Gly Arg Leu Ser Leu Glu Arg Leu Val Glu Val Met Cys Glu Lys Pro 
355 360 365 

Ala Lys Leu Phe Glu lie Tyr Pro Gin Lys Gly Thr Leu Gin Val Gly 
370 375 380 

Ser Asp Ala Asp Leu Leu lie Leu Asp Leu Asp lie Asp Thr Lys Val 
385 390 395 400 

Asp Ala Ser Gin Phe Arg Ser Leu His Lys Tyr Ser Pro Phe Asp Gly 
405 410 415 

Met Pro Val Thr Gly Ala Pro Val Leu Thr Met Val Arg Gly Thr Val 
420 425 430 

Val Ala Glu Lys Gly Glu Val Leu Val Glu Gin Gly Phe Gly Gin Phe 
435 440 445 

Val Thr Arg His Asp Tyr Glu Ala Ser Lys 
450 455 458 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
atgtttgacg taatagttaa gaactgccgt atggtgtcca gcgacggaat caccgaggca 
gacattctgg tgaaagacgg caaagtcgcc gcaatcagct cggacacaag tgatgttgag 
gcgagccgaa ccattgacgc gggtggcaag ttcgtgatgc cgggcgtggt cgatgaacat 



tctgcggccg tgggaggcat caccaccatc atcgagatgc cgttaacctt cccgcccacc 

accactttgg acgccttcct cgaaaagaag aagcaggcgg ggcagcggtt gaaagttgac 

ttcgcgctct atggcggtgg agtgccggga aacctgcccg agatccgcaa aatgcacgac 

gccggcgcag tgggcttcaa gtcaatgatg gcagcctcag ttccgggcat gttcgacgcc 



60 
120 
180 



gcgagccgaa ccattgacgc gggtggcaag ttcgtgatgc cgggcgtggt cgatgaacat 240 



300 
360 
420 
480 
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gtcagcgacg gcgaactgtt cgaaatcttc caggagatcg cagcctgtgg ttcagtcgtc 540 

gtggtccatg ccgagaatga aacgatcatt caagcgctcc agaagcagat caaagccgct 600 

ggtcgcaagg acatggccgc ctacgaggca tcccaaccag ttttccagga gaacgaggcc 660 

attcagcgtg cgttactact gcagaaagaa gccggctgtc gactgattgt gcttcacgtg 720 

agcaaccctg acggggtcga gctgatacat caggcgcaat ccgagggcca ggacgtccac 780 

tgcgagtcgg gtccgcagra tctgaatatc accacggacg acgccgaacg aatcggaccg 840 

tatatgaagg tcgcgccgcc cgtccgctca gccgaaatga acgtcagatt atgggaacaa 900 

cttgagaacg ggctcatcga cacccttggg tcagaccacg gcggacatcc tgtcgaggac 960 

aaagaacccg gctggaagga cgtgtggaaa gccggcaacg gtgcgctggg ccttgagaca 1020 

tccctgccta tgatgctgac caacggagtg aataaaggca ggctatcctt ggaacgcctc 1080 

gtcgaggtga tgtgcgagaa acctgcgaag ctctttggca tctatccgca gaagggcacg 1140 

ctacaggttg gttccgacgc cgatctgctc atcctcgatc tggatattga caccaaagtg 1200 

gatgcctcgc agttccgatc cctgcataag tacagcccgt tcgacgggat gcccgtcacg 1260 

ggtgcaccgg ttctgacgat ggtgcgcgga acggtggtgg cagagaaggg agaagttctg 1320 

gtcgagcagg gattcggcca gttcgtcacc cgtcacgact acgaggcgtc gaagtga 1377 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Met Phe Asp Val He Val Lys Asn Cys Arg Met Val Ser Ser Asp Gly 
5 10 15 

He Thr Glu Ala Asp He Leu Val Lys Asp Gly Lys Val Ala Ala He 
20 25 30 

Ser Ser Asp Thr Ser Asp Val Glu Ala Ser Arg Thr He Asp Ala Gly 
35 40 45 

Gly Lys Phe Val Met Phe Gly Val Val Asp Glu His Val His He He 
50 55 60 
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Asp Met Asp Leu Lys Asn Arg Tyr Gly Arg Phe Glu Leu Asp Ser Glu 
65 70 75 80 

Ser Ala Ala Val Gly Gly He Thr Thr He He Glu Met Pro Leu Thr 
85 90 95 

Phe Pro Pro Thr Thr Thr Leu Asp Ala Phe Leu Glu Lys Lys Lys Gin 
100 105 110 

Ala Gly Gin Arg Leu Lys Val Asp Phe Ala Leu Tyr Gly Gly Gly Val 
115 120 125 

Pro Gly Asn Leu Pro Glu He Arg Lys Met His Asp Ala Gly Ala Val 
130 135 140 

Gly Phe Lys Ser Met Met Ala Ala Ser Val Pro Gly Met Phe Asp Ala 
145 150 155 160 

Val Ser Asp Gly Glu Leu Phe Glu He Phe Gin Glu He Ala Ala Cys 
165 170 175 

Gly Ser Val Val Val Val His Ala Glu Asn Glu Thr He lie Gin Ala 
180 185 190 

Leu Gin Lys Gin lie Lys Ala Ala Gly Arg Lys Asp Met Ala Ala Tyr 
195 200 205 

Glu Ala Ser Gin Pro Val Phe Gin Glu Asn Glu Ala He Gin Arg Ala 
210 215 220 

Leu Leu Leu Gin Lys Glu Ala Gly Cys Arg Leu He Val Leu His Val 
225 230 235 240 

Ser Asn Pro Asp Gly Val Glu Leu He His Gin Ala Gin Ser Glu Gly 
245 250 255 

Gin Asp Val His Cys Glu Ser Gly Pro Gin Tyr Leu Asn He Thr Thr 
260 265 270 

Asp Asp Ala Glu Arg lie Gly Pro Tyr Met Lys Val Ala Pro Pro Val 
275 280 285 

Arg Ser Ala Glu Met Asn Val Arg Leu Trp Glu Gin Leu Glu Asn Gly 
290 295 300 

Leu lie Asp Thr Leu Gly Ser Asp His Gly Gly His Pro Val Glu Asp 
305 310 315 320 

Lys Glu Pro Gly Trp Lys Asp Val Trp Lys Ala Gly Asn Gly Ala Leu 
325 330 335 



Gly Leu Glu Thr Ser Leu Pro Met Met Leu Thr Asn Gly Val Asn Lys 
340 345 350 



Gly Arg Leu Ser Leu Glu Arg Leu Val Glu Val Met Cys Glu Lys Pro 
355 360 365 
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Ala Lys Leu Phe Glu He Tyr Pro Gin Lys Gly Thr Leu Gin Val Gly 
370 375 380 

Ser Asp Ala Asp Leu Leu He Leu Asp Leu Asp He Asp Thr Lys Val 
385 390 395 400 

Asp Ala Ser Gin Phe Arg Ser Leu His Lys Tyr Ser Pro Phe Asp Gly 
405 410 415 

Met Pro Val Thr Gly Ala Pro Val Leu Thr Met Val Arg Gly Thr Val 
420 425 430 

Val Ala Glu Lys Gly Glu Val Leu Val Glu Gin Gly Phe Gly Gin Phe 
435 440 445 

Val Thr Arg His Asp Tyr Glu Ala Ser Lys 
450 455 458 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: nucleic acid 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



atgtttgacg 


taatagttaa 


gaactgccgt 


atggtgtcca 


gcgacggaat 


caccgaggca 


60 


gacattctgg 


tgaaagacgg 


caaagtcgcc 


gcaatcagct 


cggacacaag 


tgatgttgag 


120 


gcgagccgaa 


ccattgacgc 


gggtggcaag 


ttcgtgatgc 


cgggcgtggt 


cgatgaacat 


180 


gcgagccgaa 


ccattgacgc 


gggtggcaag 


ttcgtgatgc 


cgggcgtggt 


cgatgaacat 


240 


tctgcggccg 


tgggaggcat 


caccaccatc 


atcgagatgc 


cgttaacctt 


cccgcccacc 


300 


accactttgg 


acgccttcct 


cgaaaagaag 


aagcaggcgg 


ggcagcggtt 


gaaagttgac 


360 


ttcgcgctct 


atggcggtgg 


agtgccggga 


aacctgcccg 


agatccgcaa 


aatgcacgac 


420 


gccggcgcag 


tgggcttcaa 


gtcaatgatg 


gcagcctcag 


ttccgggcat 


gttcgacgcc 


480 


gtcagcgacg 


gcgaactgtt 


cgaaatcttc 


caggagatcg 


cagcctgtgg 


ttcagtcgcc 


540 


gtggtccatg 


ccgagaatga 


aacgatcatt 


caagcgctcc 


agaagcagat 


caaagccgct 


600 


ggtcgcaagg 


acatggccgc 


ctacgaggca 


tcccaaccag 


ttttccagga 


gaacgaggcc 


660 


attcagcgtg 


cgttactact 


gcagaaagaa 


gccggctgtc gactgattgt 


gcttcacgtg 


720 
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agcaaccctg acggggtcga gctgatacat cgggcgcaat ccgagggcca ggacgtccac 780 

tgcgagtcgg gtccgcagta tctgaatatc accacggacg acgccgaacg aatcggaccg 840 

tatatgaagg tcgcgccgcc cgtccgctca gccgagatga acgtcagatt atgggaacaa 900 

cttgagaacg ggctcatcga cacccttggg tcagaccacg gcggacatcc tgtcgaggac 960 

aaagaacccg gctggaagga cgtgtggaaa gccggcaacg gtgcgctggg ccttgagaca 1020 

tccctgccta tgatgctgac caacggagtg aataaaggca ggctatcctt ggaacgcctc 1080 

gtcgaggtga tgtgcgagaa acctgcgaag ctctttggca tctatccgca gaagggcacg 1140 

ctacaggttg gttccgacgc cgatctgctc atcctcgatc tggatattga caccaaagtg 1200 

gatgcctcgc agttccgatc cctgcataag tacagcccgt tcgacgggat gcccgtcacg 1260 

ggtgcaccgg ttctgacgat ggtgcgcgga acggtggtgg cagagaaggg agaagttctg 1320 

gtcgagcagg gattcggcca gttcgtcacc cgtcacgact acgaggcgtc gaagtga 1377 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

Met Phe Asp Val He Val Lys Asn Cys Arg Met Val Ser Ser Asp Gly 
5 .10 15 

He Thr Glu Ala Asp He Leu Val Lys Asp Gly Lys Val Ala Ala He 
20 25 30 

Ser Ser Asp Thr Ser Asp Val Glu Ala Ser Arg Thr He Asp Ala Gly 
35 40 45 

Gly Lys Phe Val Met Phe Gly Val Val Asp Glu His Val His He He 
50 55 60 

Asp Met Asp Leu Lys Asn Arg Tyr Gly Arg Phe Glu Leu Asp Ser Glu 
65 70 75 80 

Ser Ala Ala Val Gly Gly lie Thr Thr He He Glu Met Pro Leu Thr 
85 90 95 

Phe Pro Pro Thr Thr Thr Leu Asp Ala Phe Leu Glu Lys Lys Lys Gin 
100 105 HO 
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Ala Gly Gin Arg Leu Lys Val Asp Phe Ala Leu Tyr Gly Gly Gly Val 
115 120 125 

Pro Gly Asn Leu Pro Glu lie Arg Lys Met His Asp Ala Gly Ala Val 
130 135 140 

Gly Phe Lys Ser Met Met Ala Ala Ser Val Pro Gly Met Phe Asp Ala 
145 150 155 160 

Val Ser Asp Gly Glu Leu Phe Glu lie Phe Gin Glu lie Ala Ala Cys 
165 170 175 

Gly Ser Val Ala Val Val His Ala Glu Asn Glu Thr He He Gin Ala 
180 185 190 

Leu Gin Lys Gin He Lys Ala Ala Gly Arg Lys Asp Met Ala Ala Tyr 
195 200 205 

Glu Ala Ser Gin Pro Val Phe Gin Glu Asn Glu Ala He Gin Arg Ala 
210 215 220 

Leu Leu Leu Gin Lys Glu Ala Gly Cys Arg Leu He Val Leu His Val 
225 230 235 240 

Ser Asn Pro Asp Gly Val Glu Leu lie His Arg Ala Gin Ser Glu Gly 
245 250 255 

Gin Asp Val His Cys Glu Ser Gly Pro Gin Tyr Leu Asn He Thr Thr 
260 265 270 

Asp Asp Ala Glu Arg He Gly Pro Tyr Met Lys Val Ala Pro Pro Val 
275 280 285 

Arg Ser Ala Glu Met Asn Val Arg Leu Trp Glu Gin Leu Glu Asn Gly 
290 295 300 

Leu He Asp Thr Leu Gly Ser Asp His Gly Gly His Pro Val Glu Asp 
305 310 315 320 

Lys Glu Pro Gly Trp Lys Asp Val Trp Lys Ala Gly Asn Gly Ala Leu 
325 330 335 

Gly Leu Glu Thr Ser Leu Pro Met Met Leu Thr Asn Gly Val Asn Lys 
340 345 350 

Gly Arg Leu Ser Leu Glu Arg Leu Val Glu Val Met Cys Glu Lys Pro 
355 360 365 

Ala Lys Leu Phe Glu He Tyr Pro Gin Lys Gly Thr Leu Gin Val Gly 
370 375 380 



Ser Asp Ala Asp Leu Leu He Leu Asp Leu Asp He Asp Thr Lys Val 
385 390 395 400 



Asp Ala Ser Gin Phe Arg Ser Leu His Lys Tyr Ser Pro Phe Asp Gly 
405 410 415 
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Met Pro Val Thr Gly Ala Pro Val Leu Thr Met Val Arg Gly Thr Val 
420 425 430 

Val Ala Glu Lys Gly Glu Val Leu Val Glu Gin Gly Phe Gly Gin Phe 
435 440 445 

Val Thr Arg His Asp Tyr Glu Ala Ser Lys 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: nucleic acid 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

atgtttgacg taatagttaa gaactgccgt atggtgtcca gcgacggaat caccgaggca 60 

gacattctgg tgaaagacgg caaagtcgcc gcaatcagct cggacacaag tgatgttgag 120 

gcgagccgaa ccattgacgc gggtggcaag ttcgtgatgc cgggcgtggt cgatgaacat 180 

gcgagccgaa ccattgacgc gggtggcaag ttcgtgatgc cgggcgtggt cgatgaacat 240 

tctgcggccg tgggaggcat caccaccatc atcgagatgc cgtttacctt cccgcccacc 300 

accactttgg acgccttcct cgaaaagaag aagcaggcgg ggcagcggtt gaaagttgac 360 

ttcgcgctct atggcggtgg agtgccggga aacctgcccg agatccgcaa aatgcacgac 420 

gccggcgcag tgggcttcaa gtcaatgatg gcagcctcag ttccgggcat gttcgacgcc 4 80 

gtcagcgacg gcgaactgtt cgaaatcttc caggagatcg cagcctgtgg ttcagtcgcc 540 

gtggtccatg ccgagaatga aacgatcatt caagcgctcc agaagcagat caaagccgct 600 

ggtcgcaagg acatggccgc ctacgaggca tcccaaccag ttttccagga gaacgaggcc 660 

attcagcgtg cgttactact gcagaaagaa gccggctgtc gactgattgt gcttcacgtg 720 

agcaaccctg acggggtcga gctgatacat cgggcgcaat ccgagggcca ggacgtccac 780 

tgcgagtcgg gtccgcagta tctgaatatc accacggacg acgccgaacg aatcggaccg 840 

tatatgaagg tcgcgccgcc cgtccgctca gccgagatga acgtcagatt atgggaacaa 900 

cttgagaacg ggctcatcga cacccttggg tcagaccacg gcggacatcc tgtcgaggac 960 

aaagaacccg gctggaagga cgtgtggaaa gccggcaacg gtgcgctggg ccttgagaca 1020 
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tccctgccta tgatgctgac caacggagtg aataaaggca ggctatcctt ggaacgcctc 1080 

gtcgaggtga tgtgcgagaa acctgcgaag ctctttggca tctatccgca gaagggcacg 1140 

ctacaggttg gttccgacgc cgatctgctc atcctcgatc tggatattga caccaaagtg 1200 

gatgcctcgc agttccgatc cctgcataag tacagcccgt tcgacgggat gcccgtcacg 1260 

ggtgcaccgg ttctgacgat ggtgcgcgga acggtggtgg cagagaaggg agaagttctg 1320 

gtcgagcagg gattcggcca gttcgtcacc cgtcacgact acgaggcgtc gaagtga 1377 



(2) INFORMATION FOR SEQ ID NO: 14: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

Met Phe Asp Val lie Val Lys Asn Cys Arg Met Val Ser Ser Asp Gly 
5 10 15 

He Thr Glu Ala Asp He Leu Val Lys Asp Gly Lys Val Ala Ala He 
20 25 30 

Ser Ser Asp Thr Ser Asp Val Glu Ala Ser Arg Thr He Asp Ala Gly 
35 40 45 

Gly Lys Phe Val Met Phe Gly Val Val Asp Glu His Val His He He 
50 55 60 

Asp Met Asp Leu Lys Asn Arg Tyr Gly Arg Phe Glu Leu Asp Ser Glu 
65 70 75 80 

Ser Ala Ala Val Gly Gly He Thr Thr He He Glu Met Pro Phe Thr 
85 90 95 

Phe Pro Pro Thr Thr Thr Leu Asp Ala Phe Leu Glu Lys Lys Lys Gin 
100 105 HO 

Ala Gly Gin Arg Leu Lys Val Asp Phe Ala Leu Tyr Gly Gly Gly Val 
115 120 125 

Pro Gly Asn Leu Pro Glu He Arg Lys Met His Asp Ala Gly Ala Val 
130 135 140 

Gly Phe Lys Ser Met Met Ala Ala Ser Val Pro Gly Met Phe Asp Ala 
145 150 155 160 
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Val Ser Asp Gly Glu Leu Phe Glu lie Phe Gin Glu He Ala Ala Cys 
165 170 175 

Gly Ser Val Ala Val Val His Ala Glu Asn Glu Thr He He Gin Ala 
180 185 190 

Leu Gin Lys Gin He Lys Ala Ala Gly Arg Lys Asp Met Ala Ala Tyr 
195 200 205 

Glu Ala Ser Gin Pro Val Phe Gin Glu Asn Glu Ala He Gin Arg Ala 
210 215 220 

Leu Leu Leu Gin Lys Glu Ala Gly Cys Arg Leu He Val Leu His Val 
225 230 235 240 

Ser Asn Pro Asp Gly Val Glu Leu lie His Arg Ala Gin Ser Glu Gly 
245 250 255 

Gin Asp Val His Cys Glu Ser Gly Pro Gin Tyr Leu Asn He Thr Thr 
260 265 270 

Asp Asp Ala Glu Arg He Gly Pro Tyr Met Lys Val Ala Pro Pro Val 
275 280 285 

Arg Ser Ala Glu Met Asn Val Arg Leu Trp Glu Gin Leu Glu Asn Gly 
290 295 300 

Leu He Asp Thr Leu Gly Ser Asp His Gly Gly His Pro Val* Glu Asp 
305 310 315 . 320 

Lys Glu Pro Gly Trp Lys Asp Val Trp Lys Ala Gly Asn Gly Ala Leu 
325 330 335 

Gly Leu Glu Thr Ser Leu Pro Met Met Leu Thr Asn Gly Val Asn Lys 
340 345 350 

Gly Arg Leu Ser Leu Glu Arg Leu Val Glu Val Met Cys Glu Lys Pro 
355 360 365 

Ala Lys Leu Phe Glu He Tyr Pro Gin Lys Gly Thr Leu Gin Val Gly 
370 375 380 

Ser Asp Ala Asp Leu Leu He Leu Asp Leu Asp He Asp Thr Lys Val 
385 390 395 400 

Asp Ala Ser Gin Phe Arg Ser Leu His Lys Tyr Ser Pro Phe Asp Gly 
405 410 415 

Met Pro Val Thr Gly Ala Pro Val Leu Thr Met Val Arg Gly Thr Val 
420 425 430 

Val Ala Glu Lys Gly Glu Val Leu Val Glu Gin Gly Phe Gly Gin Phe . 
435 440 445 



Val Thr Arg His Asp Tyr Glu Ala Ser Lys 
450 455 458 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
agaacatatg tttgacgtaa tagttaagaa 30 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
aaaaggatcc tcacttcgac gcctcgta 28 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
aggcgacata tgaccctgca gaaagcgcaa 30 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: nucleic acid 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
atgggatccc cggtcaagtg ccttcattac 
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